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(54) Tide: TRICK PLAY SIGNAL GENERATION FOR A DIGITAL VIDEO RECORDER 
(57) Abstract 

Trick play speed +4 



The invention 

to the generation of a trick 
play information stream from 
a normal play Information 
stream, so that they can 
be recorded together as a 
composite information stream 
on the record carrier, such 
that upon reproduction in 
a trick play reproduction 
mode, an information signal 
of sufficient quality, eg. 
as regards visibility, can 
be obtained. One aspect of 
the invention lies in the 
generation of GOPs, each 
GOP comprising an I-frame 
retrieved from the original 
normal play information 
stream, and one or more 
so-called "empty P frames". 
Another aspect of the 
invention is the requirement 
of generating -GOPs- for the 
trick play Information stream 
that have a constant bitcost 
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Trick play signal generation for a digital video recorder. 



si*™, rf ™ ti ° nrelatet ^^ 

IS TJH ^^^^^^^^no^in^p^^: 
s^eed.sa.dtnckplay.gnalbeingn.eantforareproduction^ 

method of recording such trick play signal and to a record carrier. 

An apparatus as defined in the opening paragraph is known from published 
"ts^^X PateDt ap P^ cat * on WO 95/28061 (PHN 14832), which corresponds to US-A 
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over,™ J? T" ° f ^ teleVto< "' hM m ° f »— •» 
evermore than three deCKie5 . ^ ^ h _ B0 fcr ^ ^ 

l^M^^h^h^ of video and associated audio data. The first MPEG publication dated April 
.992 whrch resulted m d» mtroductirm of MPEO-1. This system was designed to reduTth, 

has teen developed. This technology bus heen selected to be used in Digital Video 

ZTT^™'™ ta * »^ *> »-■* «-«. *-* video a. _ accept 
U~ b,, mto. Tuts e^ble, tite customer to receive studio quality at his own place. In order to 
eastern, cuaomer to store a DVB program with Audio quality, , dlgtod video seconder is 
rurotred. Smoe two yMr8 ^ vidK , ^ ^ ^ ^ 

for tit. consumer market Both systems make use of dedicated video hi. rote reduction 
^tology which is no, compatib,, wnh toe comprosai™ ..Cmotogy tha, U used in MPEG-2. 
In oroer to store a selected DVB prognun and mahttainlng me high quality, a tianspamn, 
-Moroer ts required. Although a stands* haa bee. denned to atoro MPEG onn DV recoroe, 
up to now no equipment has l«e„ produced mat support mis ahmd.ro, A popuhu consumer ' 
and'roW 81 ^^ S *° I8 ^ e l ^ ev t ce i® tite VHS based video recorder. This system is capable of storing 

^evu^analognumoviauaiprogroms.Toenab.estomgeofdigitolte.eviri™ 
dtgttol extension * cununtly developed to enable the consumer to store and retrieve DvT 
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program, Tne currently developed standard describes the record and playback aspect of this 
system. Not yet included in this standard is how to oreformw., , ,/ 
track sew WE .™ f t\ \Tua » P Play - Trick P Ifl y based on ^ 

track select system forD-VHS MPEG-2 STD mode format will be described in this report 

format TM * 17 ^ 1996aneWStandarihasb — eleased,D-VHS MPEG2STDmode 
format ^ita! version of the VHS recorder family is capable of recording DVB s ig na,s at 
u P to ^IB.SMbits/s.Thestandard.asit is ^^^^^ZZt 

Pro P os ^for™^g^ 

In accordance with the invention, the apparatus for recording a digital video 
mfonnanon s lg nal on a record carrier, the apparatus comprising 

- mput means for receiving the digital video information signal 

-tricky play signal generating means for generating a trick play ^ from ^ ^ video 
reproduction speed, where mis an integer larger than 1, 

- merging means for merging said digital video information signal and said trick play signal 
into a composite information signal, 7 ^ 

said tack play signal generating means being adapted to 

(a) retrieve intra encoded pictures from said digital video information signal, 

(b) generate inter encoded pictures, 

(c) me*, said inna encoded pieteee and ,aid inter encoded picnnes, so as to M . ^ 
P* s.gna. eon^rising sueseqoeo, groops of pfe^, „ ^ ^ 
fono.edby.n^ofoof^g^^^^^^^^ • 
^ toan 0, ft. general tater eroded pieces being snch that, upo „ faction « ssid 
mole speod, an in,er encoded picpn-e following an tan, eroded picnue resalts, upon 
decodtng, m a Rested presentaion of a pictae obtttaed front eroding Si 0d tara encoded 
picture. 

nri-h e ™ ei " M ^^ Pi ^ SMUldbetaaefc ""°ftanal^e«^edpicnnes, 

<m» of tntafnnne encoded pieces, or in the form of taerfieid encoded pieces. ,„ fc 
W ^^P^i«»ffloe M s» ns d teUl e toaB ^ pictllreSMtoaieft)im 
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frame encoded pictures and that the inter encoded pictures are in th. * 

encoded pictures. 016 form of "»terframe _ 

The invention is based on the following re™*™*; — . 
P>av infoimation signal is based on the retrieval T ^ "* 

. Picntres in attendance ^ ^ ^ f ~° T """^ ""* 28 1 

those intra Sam. encoded picntres fa the nick nto, toTT Bmsmis *° -* 

-• r ^.enoT:rZr::i;~^rrT A8 

posstble fa reaife. , trick phy tafo^ d f °° W 

A further aspect of the invention is that GOPs are eenm-atoH e .u ■ 

=rrrr~r =£=s. 

play information signal. More specifically an int™ fr, . , 

^^danonuterosAcJ^^^ 

in the trick oiav inform «^ • , «c reinevea therefrom and used 

racK P la y information signal as the DC coeffinWc *v 

nnmber of AC coefflcients ta th„ pfann, is ,„ . " ' **"t 
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The number of AC coefflcie*. ^ for a sobpk*, of. specfflc 
encoded pen™ of the hick p.ay signal ttopentla o» the diffemace Ween ^ Dc 
coefficiems, thc»e DC coefficient being fte pc - <h> ^ ^ 

prowoaa eubpicmre of the specific inbalhmre mcod ed picture to a» W ck p.a, sigaal 
„■<*„, Th6 " f r 1ProCe!Sta8 0f tow -« 1 rt^Phy win bedescribed. Low-end 

video matenal to create video trick play. 

™ e ™ dMte *^^P^insta»c<»rdance„id I d.ain„ enli o„ resl d (s 
.aaiowtadwarac^i^.F^^^^^^^^^^ 

2^-^^Fora, lt bo S aa^, e c^ W o Mbp ^^ calbeiisedto 

ptctn^The second pammemr « .be temporal reueeh ^ at which ^ ^ 
^p^^iow^^^^^,^^ 

w.da^ofmo.piayaigoaipn,^^,^^^^ 

used to implement tnck play. The trick play system used in this report is baaed on back select 



^^^^n'^nwer.don.d.lb.appamn.a^^^,,^ 
^^^■•nnmenmodinnmtsde^h^ 

• n ^/^^^'"^^^^^^win-twoheadaA^tdBand 
m figure lb the tracks on tape with different azimuth. 

flBTO2 ^ iM *»'»I»»««f<l.«tbe« J aduri. lsrc p ro d 0cli0n , wherefi 
2. ah™ fa scan pah during uomta. play and ftgme 2 b ahowa the scau path for trick pby 
with speed equal to four times normal play 

figure 3 shows the contents of two consecutive sync blocks, 
figure 4 shows the relation between PAT and PMT packet, ' 
figure 5 shows the temporal sub-sampling of a PCM video sequence, where 

tnckplaywxthaspeedpmontheP 

vtdeo tnck play with a trick play speed of +4 times the normal play speed, 

# . _ fi ^ e6sh ° WsAetm P° ra,rab - sa ^gofanMPEGenc^^ 
wtth a GOP length N=12 and M=4, where figure 6a shows the normal play video in MPEG 
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format, figure 6b shows trick play with a speed of +4 on the MPEG pictures and figure 6c 
shows the video trick play with a trick play speed of + 4 times the normal play speed 

figure 7 shows the temporal sub-sampling of a MPEG encoded video sequence 
w.th a GOP length N-12 and M=3. where figure 7a shows the normal play video in MPEG 
format, figure 7b shows the trick play on MPEG pictures and figure 7c shows the trick play 
video with speed is + 4 times normal play speed, 

figure 8 shows the scan pattern of DCT blocks within a shoe, 
figure 9 shows the filling degree of the transcoder I-frame buffer, 
figure 10 shows the flowchart for the I-frame transcoding buffer regulation 
figure 1 1 indicates the relation between the pictures for the different trick play 
tape speeds, * 7 

figure 12 shows the extraction trick play information for higher video trick play 

speeds from the trick play signal for the +4 times trick play speed, 

figure 13 shows the generation of the trick play information for a reverse trick 

play speed by swapping the GOP of the forward trick play speed, 

figure 14 shows the GOP layout at transport stream level, 

figure 15 shows a block diagram of the trick play signal processing, 

figure 16 shows the decoding and presentation time stamps for 25 Hz frame rate 

on the system time axis, 

figure 17 shows in figure 17a a transport stream without jitter on a PCR packet 
and in figure 17b a transport stream which has jitter on a PCR packet, 

figure 18 shows the manipulation of distance of succeeding transport stream 

packets, and 

figure 19 shows the recording apparatus in accordance with the invention. 



In the following figure description, the invention will be described in the form 
of an example where the intra frame encoded pictures are in the form of I pictures as encoded 
m accordance with the MPEG encoding standard, and where the interframe encoded pictures 
are in the form of P pictures as encoded in accordance with the MPEG encoding standard. 

The general principles of track select trick play will be first described. D-VHS 
like its analog counter part, is a helical scan recorder. Ihis means that the information is 
wntten on tape by means of a scanner which is positioned under a angle with respect to the 
longitudinal direction of the tape. The D-VHS scanner used for the MPEG-2 STD mode has 
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two heads A and B with different azimuth. Figure la indicates the nositi , . 

tape, obtained duringrecording, using the above head conngur^ " ^ 

^ngnonnal play reproduction, those two heads read toft™, ^ 
5 m such a way , that head A read the tracks written untv> u „ ,nfo ""ation from tape 

procedure is valid for head B During IT A ^ The same 

ror head B. During trick play reproduction, the heads A and R 
different scan path with resn^t «. md B .°ave a 

^ P^ at* figure 2b sh™ a,, " fer * «» — tuning 
normal play. a«n path for tnckplaywm, speed w fcur ^ 

Track select trick play is based on the fact that head A ann n 
ttaemdned tracks. When soch a ft . head A and B crosses pre- 

tape in snch a way tha, this date L^TTl' " " i "' bnMto « 

P-aya— deSned^i^lkT * ^'^ ^ Fot apecinc «ck 

^d»,arede^ p TZn;T* A ^ foOT " WWCh ^«*^ 
P^is-cscHc^r^ 

09/.3547 (PHN , S2 mXIc 7^ °f applicant, such as US ser. no. 

-^p^xt^T^r^rr 1898 ^ 

play speeds, it can be concluded that the values J^jand q aTdeflnedto th7 

and 2, in the present example. * e cUms - 3 

apphcadons, indict ma, J^, ^ —~ P— 

*w ouiouni oi aata, that is read durine one revAl»H/*« 

— " "-.ttckp^spe* of da^CilTLl™!,*: ■ 
one revolution of the scanner ftr™ retneved, dunng 

.ynob.ockswiUconjnre^rT'? ^ ,0 ^ Bd ^ dtaB9W »' ^ 
ayncblocks ,„ ^XtTt T C " teC, '' M ' ^ " M * «— «« 

t^paAetsoSltrensportstrenmpacketsarestoredpcrrevolution Thetis, 
only used to store oavload snm. „r.t v revolution. The 1 12 bytes are not 

■-•=:~-=r~rrr~-=. 
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correct erroneous syncblocks. A part of the system information is necessary to distinguish 
between normal play syncblocks, dummy syncblocks (stuffing) and trick play syncblocks ' 
whereby even a distinction is made for the different trick play speeds. For each syncblock, this 
^formation is written in the main header. The first syncblock contains beside the first part of a 
transport stream packet, also contains a timestamp value, the packet header. This is a four byte 
field where information is stored which is necessary for the play back side of the system in 
order to reconstruct the original timing of the successive transport stream packets. Figure 3 
shows the two consecutive syncblocks that carry one transport stream packet. 

From the amount of payload syncblocks that are read per revolution the 
channel bit rate can be calculated. D-VHS MPEG-2 STD mode supports two scanner 
revolutions. The first scanner revolution is 30 Hz, the second scanner revolution is 
30*(1000/1001) « 29,97 Hz. For both situations, the channel bit rate has been calculated and is 
available in table L 



Scanner revolution 


Trick play channel bit rate (bits/s) 


30 Hz 


2301120 


29,97 Hz 


2298821.17 



Table 1 : Channel bit rate for 30 Hz and 29,97 Hz drum frequency. 

The bit rates from table 1.0 are the maximum bit rates that can be used to generate a video 
trick play stream at transport stream level. 

The MPEG compressed video trick play information will be embedded in a 
transport stream which must fit in the trick play bandwidth as calculated above. In order to 
generate video trick play based on the normal play video information, the pictures must be 
extracted from the normal play video stream. The description below wilLdeal with the 
different processing steps required to generate trick play from a received DVB stream. The 
two main processing steps are: 



■ Transport stream demultiplexing 
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■ Video elementary stream processing 

The ™Wpl« opendton „ Wch ^ ^ ^ ^ ^ > 

stream will be diseased in ctap«r Sree. The reuon f„ r m „ ^ ta 

— • i • P« of .he bit «e, and does no, add anytotag to «he video h «, ^ 

resolution and the temporal refresh rate. 

Audio visual information that is broadcasted by DVB, makes use of a transport 
stream layer. This layer is configurated in such a way that packets, with a fixed length of 188 
bytes, carry beside audio visual information also data like videotext and Program Specific 

mformaUon (PSI) from the provider to the end-user. For the tra^ 
been denned: 



■ DVB-S 

■ DVB-C 
15 BDVB-T 
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25 



30 



the extension S, C and T stands for Satellite, Cable and Terrestrial respectively Each 
transmission layer is optimized conform its own channel. At the decoder side, the output of the 
channel decoder is a transport stream Normally this transport stream cames N programs 
After selecting one or more programs, which selection is necessary because the recording 
channel rate is lower than the transmission channel rate of the transport stream, a recording 
operation is possible. In order to generate video trickplay based on the recorded program 
special signal processing is required. The first step is to extract the video elementary stream 
out of the transport stream multiplex. This operation is done by the demultiplexer. 

Video data that is transported in a transport stream, is multiplexed together with 
other ^formation such as audio, videotext and PSI. At the receiver side, a program is 
demultiplexed in such a way that all the data of the same type such as video, audio etc are 
separated from the multiplexed stream. The way to demultiplex program information, is 
earned m the transport stream. Two tables, Program Association Table (PAT) and Program 
Map , Table (PUT) carry information which enables a transport stream decoder to retrieve all 
the ^formation for one program from the multiplexed transport stream. This process is 
described in detail in ISO/IEC 13818-1. After retrieving the video data from the multiplexed 
transport stream, elementary stream processing can be performed on the extracted video 
elementary stream. Figure 4 shows the relation between the PAT and the PMT packet 
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respectively. The PAT packet contains all the available programs in the multiplexed transport 
stream. Each program number has an associated program map PID. This program map PID 
refers to the PMT packet which contains all the PID values that build up one program. This 
PMT table indicates which packet contains the video information. This is done by means of 
the stream type identifier and the corresponding elementary PID value. 

Low-end video trick play is based on re-use of pre-encoded video material. In 
case of DVB programs, this means pre-encoded MPEG video. Video trick play, can be treated 
as a normal play video signal, that is sub-sampled in the temporal direction. Practically mis 
means that only some pictures in the temporal direction are of interest The coarseness of the 
sub-sample process, the amount of pictures that are skipped, depends on the trick play speed. 
If a video sequence is available in the PCM domain then the following graphical 
representation can be made. Figure 5 contains three sketches. Figure 5a indicates the pictures 
of a normal play stream on the time axis. Figure 5b shows the same time axis as in figure 5a 
whereby the pictures that form the video contents of a trick play video sequence, with a speed 
of four times normal play speed, are dark coloured, while the pictures that are skipped are 
transparent. Finally figure 5c contains those pictures from a normal play sequence mat form a 
trick play sequence which corresponds to four times normal play speed. 

The process depicted in figure 5 can also be performed on MPEG pre-encoded 
video. Figure 6 indicates this process. Figure 6a shows a MPEG encoded normal play video 
sequence with N=l 2 and M=4. Hereby is N the length of a GOP and M is the P-frame 
distance. Figure 6b indicates the sub-sample process for a GOP structure with N=12 and M=4. 
The dark coloured pictures from figure 6b are selected to form the video trick play sequence. 
The transparent pictures are skipped. The filtered pictures form a video trick play sequence, 
see figure 6c. This video sequence does not only contain the pictures that corresponds to the 
trick play speed, they even form a valid MPEG stream due to the fact that the motion 
estimation done at the encoder side makes use of the selected pictures. This enables the 
decoder to correctly reconstruct the encoded motion compensated pictures. This last point is 
important because this will normally not be the case. 

Figure 7 indicates the same process but the GOP has a different structure N=l 2 
and M=3. When a video trick play stream is extracted from this GOP structure, whereby the 
filtered pictures corresponds to those that build up a video trick play sequence for four times 
normal play speed, the temporal MPEG reference is corrupted. 

From the previous two examples it can be concluded that only the intra frame 
coded pictures can be re-used for trick play. The reason for this statement is two fold. First, 
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these frames can be standalone decoded, no fixture or past picture information is necessary 
Second, the pictures contain beside the interlace effect, no temporal information. The interlace 
effect does only occur, when the original scene is interlaced. If the original video scene is 
progressive, for eg. film material, then there is no interlace effect when I-frames are repeated. 

5 The elementary stream video processing has as task, to generate a valid video 

MPEG elementary stream that can be multiplexed into a MPEG transport stream. The video 
elementary stream has two main parameters that must meet specific requirements. The first 
parameter is the overall bit rate that will be used for the trick play video elementary stream. 
The second parameter is the frame rate of the video elementary stream. This last parameter 

10 depends on the continent where the trick play stream is generated. A distinction can be made 
between continents that support 25 Hz frame rate and those that use 29,29 Hz or 30 Hz frame 
rate. 

It has been stated before that for video trick play based on video extraction from 
a MPEG pre-encoded program, only the intra frame encoded pictures can be used. The bit cost 
15 of the intra frame encoded pictures depend, beside the size of the picture, strongly on the 
overall bit rate, that has been used to encode the normal play video elementary stream 
sequence. For the video elementary stream bit rate a simple expression can be used, that 
defines the relation between the video elementary stream bit rate and the bit cost per picture. 
In case of a fixed bit cost per picture the overall bit rate will be equal to equation 1 . 



20 



Video bitrate - frame rate «. frame bitcost (1) 



An MPEG encoded video sequence will normally not have a fixed bit cost per 
picture. Intra frame encoded pictures will have a bit cost that is larger than the bit cost that is 
25 used for motion compensated pictures, such as P- and B pictures. In general MPEG intra frame 
encoded picture requires a transmission time that is larger than one display frame period. For 
typical I-frame bit cost values see table 2, and table 3 for the required transmission times. For 
encoding parameters see table 4. 
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sequences 



Table 2: Typical I-frame bit cost value found 



These values depend heavily on: 

■ The used MPEG encoder 

■ The used GOP structure 

■ The used picture size 



10 



Normal play video 
sequence 


Average I-frame 
Transmission time 
(ms) 


Minimum I-frame 
Transmission time 
(ms) 


Maximum I-frame 
Transmission time 
(ms) 


HARLEY 


81.9 


45.8 


116.9 


BARB WIRE 


82.6 


13.5 


166.0 


NEDERLAND-2 


83.5 


13.6 


128.0 


GIRLS 


72.2 


56.4 


113.7 
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Table 3: Typical normal play I-frame transmission times. 



Video sequence: 


Normal play video sequence 
encoding parameters 




GOP parameters 


Bit rate 
Mbits/s 


Amount 
of pels 
per line 


Amount 
oflines 
per frame 




M 


N 








HARLEY ' 


3 


12 


9.4 ' 


720 


576 


BARB WIRE 


3 


12 


3.4 


528 


576 


NEDERLAND-2 


1 


12 


5.0 


544 


576 


GIRLS 


3 


12 


8.0 


720 


576 1 



Table 4: Encoding parameters for the video sequences: Harley, Barbwire, 
Nederland-2 and Girls. 



With aid of the values from table 2, some rough calculations can be preformed 
on the average bit rates per I-frame for a GOP structure N-l. Table 5 contains the 
transmission bit rates for the I-frames from table 2. From the values depicted in table 5, it 
.should become clear, that generating a video trick play sequence with a GOP containing only 
the selected I-frame, and maintaining a frame rate of 25 Hz, requires a high trick play channel . 
bit rate. Due to the feet, that intra frame encoded pictures are selected from the normal play 
video sequence, the peek bit rate will requires huge channel bit rates values, except for the 
BARBWIRE video sequence, and in some cases the required bandwidth is even higher than 
the maximum bit rate allowed in some MPEG applications. For this reason video trick play 
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based on I-fiame extraction wil, not be possible without extra signal processing. There are 
some methods that can be used to reduce the bandwidth problem and still obey the name rate 
constrain. 

A method that is relatively cheap to implement, is to insert so called empty P- 
frames. Empty P-frame are pictures that force the decoder to display an exact copy of the 
previous decoded picture. Because no extra ^formation is required by the decoder, the P- 
fhunemustonlytraw^ 

and the last macro block of a slice needs to be transmitted. As a result the empty P-name bit 
cost ,s very small. This signal processing step lowers the perceived refresh rate, but creates 
transmission time for the relative large I-fiame, Disadvantage of this method is, that the 
Picture refresh rate will be reduced, whereby the picture refresh rate can be reduced up to one 
P^ture per second, but with a spatial resolution equal to that of the original I-frame. A better 
method 1S to reduce the resolution of each intra frame encoded picture. This method will 
mcrease the picture refresh rate, but at the same nme reduce the spatial picture quality. Beside 
the lower spatial resolution extra hardware complexity is added to the video trick play signal 
processing system. 

* 



Normal play video 
sequence 


Average video bit 
rate per frame for 25 
Hz frame rate. 
(Bits/s) 


Minimum video bit 
rate per frame for 
25 Hz frame rate. 
(Bits/s) 


Maximum video bit 
rate per frame for 
25 Hz frame rate. 
(Bits/s) 


HARLEY 


19.252.100 


10.772.400 


27.492.400 


BARB WIRE 


7.028.150 


1.149.600 


14.114.200 


NEDERLAND-2 


10.445.475 


1.708.600 


16.006.100 


GIRLS 


14.450.800 


.11.290.400 


22.746.200 



Table 5: Video elementary stream trick play bit rates, for N-l GOP length, based on I-frame 
20 selection from a normal play video sequence 
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« — . . « FOrtoCkpIayb ^° nno ^l )la y^es e lection,somemeritscanbe 
d^ed^respccttohardwaretapiemen^on.Fi^ " 

^-^a-nnal play video sequence for one speed trie, p^ generationwnibegiven . 

■ I-frame can be selected by parsing the video elementary stream on byte basis 

. . , The ^ IeqUired to 6Xtract °* pictures from the normal play stream 

has low complex^, due to the feet that the stream at picture level is byte based. 

■ High spatial quality, equal to the original I-frame resolution. 

ismaintamed.^^ 

Next, some demerit parameters of normal play I-frame selection for one speed 
tack play generation will be given. 

■ Low picture refresh rate 

Due to large I-frame bit costs and a low bit rate trick play channel the 
tonsmission of one compressed intra frame encoded picture requires more than one frame 
d.splay penod. Due to this, the picture refresh rate is lower than the frame rate. 

■ L-ge picture buffer si Ze required for storage of MPEG compressed I-frame. At least one for 
each speed. 

Due to the fact, that it requires several display frame periods to transmit the 
extracted I-frame, a buffer is required to store the extracted I-frame. 



I Perceived trick-play speed depends on GOP length. 



The amount of I-frames that can be selected, depends on the GOP length N of 
the normal play video stream, if the I-frame refresh time is ,arger than the trick play I-frame 
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transmission time, than the trick play picture refresh is determined by the normal play GOP_ 
length N. In worse case situation N is equal to 1023. 

Next, the bitrate reduction by means of lowering the I-frame resolution will be 
described. Video trick play, based on I-frame selection from an MPEG encoded video 
elementary stream, will have a bit rate that is larger than the bit rate of the original video 
elementary stream, when the I-frames are used to form a new MPEG video sequence, whereby 
the GOP length N , is equal to one, which means I-frame one. The bit rate of such signals 
varies per picture, and can even be higher than the maximum allowed bit rate as defined within 
some MPEG applications. In the above description, a solution was provided based on insertion 
of so called empty P-frames to the video trick play stream in order to lower the required 
channel bit rate. Such a processing works quite well but can considerably lower the picture 
refresh rate, especially when the normal play video stream has a bit rate that is higher than 6 
Mbits/s and has full resolution, this means maximum horizontal- and vertical size. 

For the generation of video trick play, five parameters are important: 

■ Frame rate 

■ Picture bit cost 

■ Picture refresh rate 

■ Spatial resolution 



The first parameter is a constraint that must be obeyed, and is defined by the 
continent where the recording is performed. The second parameter can be calculated by 
removing the transport stream overhead from the trick play channel bit rate. Only the third 
parameter can be modified, which will have a direct influence on the last two parameters. The 
picture refresh rate and the spatial resolution can be treated objectively as well as subjectively. 
Due to the I-frame transcoding, a large part of the picture content is removed. It is for this 
reason that an objective judgement will not be appropriate. A far better method is judgement 
according to subjective criteria. 

------ The only way to increment the picture refresh rate is by lowering the I-frame bit 

cost, this will reduce the required transmission time of an intra frame coded picture. Problem 
by the I-frame bit cost reduction is the fact that an I-frame can not be endlessly reduced. In 
fact, the smallest bit cost is based on the bit cost required to create a spatial resolution whereby 
only the DC value of each DOT block is transmitted. Practically this means that the final bit 



10 
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tnfornrahon depends ^ on ^ ^ ^ 

cos, that — be avenab.e M be Ws ^ m ^ ZXl^Z 

^on. the transport stream overhead can be reduced to 5 % of the total hT2 



Normal play video 
sequence 


Average video bit 
cost (Bits) 


Minimum video bit 
cost (Bits) 


Maximum video bit 
cost (Bits) 


HARLEY 


108326 


83600 


122488 


BARB WIRE 


77329 


53944 


97200 


NEDERLAND-2 


55012 


48032 


60176 


GIRLS 


78915 


75336 


81840 



15 



Table 6: Intra frame transcoded pictures containing DC resolution. 
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frequency. 



Table 7: Elementaiy video bit rate for 30 Hz and 29,97 Hz drum 



The video elementary stream bit rates that are available in table 7, should be 
to transmit 25 Hz, 29,97Hz or 30 Hz video. Table 8 contains the bit cost for each frame 
rate, in case of a fix bit cost per picture, 



Frame rate 


Picture bit cost (bits) 


30 Hz 


72868 


29,97 Hz 


72868 


25 Hz 


87442 



Table 8: 1-frame bit cost, for 25 Hz, 29,97 Hz and 30 Hz frame 



rate. 



The bit cost per picture, depicted in table 8, is not sufficient to transcode 
selected MPEG-2 encoded I-frames in order to reuse them for trick play. This is caused by me 
feet that the required picture bit cost values are less than the maximum values in table 6. The 
only way to make a low-end trick play system work, is by reducing the picture refresh rate. 

Subjectively judged simulations with respect to the minimum required picture 
refresh rate, have shown that a picture should maximally be three times repeated. For a 25 Hz 
frame rate environment this means that the actual picture refresh rate is 8.3 Hz. For the 30 Hz 
and 29.97 Hz frame rate situations, this results in a 10 Hz picture refresh rate. 

By reducing the picture refresh rate, the minimum required temporal refresh 
rate is obeyed. Because of the picture refresh reduction of a factor three, the I-frame bit cost is 
almost tripled in size. Only a small part of the I-frame bit cost must be spend on the empty P- 
frame. For 30 Hz and 29,97 Hz frame rate systems the maximum empty P-frame size is 2800 
bits, whereby the 25 Hz frame rate system, the maximum empty P-frame bit cost is 3328 bits. 
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Frame rate 


I-frame bit cost (bits) 


30 Hz 


^riouut - 30UU = 213004 


29,97 Hz 


218604 - 5600 = 213004 


25 Hz 


262326 - 6656 = 255670 



Table 9. I-frame bit cost, for 25 Hz, 29,97 Hz and 30 Hz frame rate, with a picture refresh 
reduction of a factor 3. 



with DC resolution is strongly based on the statistic behaviour of natural video. Beside the 
video elementary stream overhead it is strongly determined by the bit cost required to 
represent the DC value of the DCT matrix. 

... f ^* de0S ^ UenWSUSe ^ 
bitmtes.Notonly W ebitrateisanimportantparamet^ 

and vertical size, of the used video sequences is important. For this reason, the video 
sequences have chosen is such a way, that different picture sizes form part of the normal play 
video analysis. 

In order to transcode an I-frame, two possible transcoding operations could be applied. 



I Full MPEG decoding up to DCT level, full re-encode at the desired lower bit 
I Selection of runlengthjevel encoded DCT coefficients 



rate 



The first method requires high hardware complexity, but will result in an acceptable up to 
good picture quality. The second method requires modest hardware complexity, and Will result 
m a poor up to good picture quality. Only the second method, is acceptable in case of low-end 
video trick play. 

Next, the bit cost reduction by means of runlength-level encoded DCT AC 
coefficient selection will be described. 
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As ka<*tedabove,alow.em^^ 
ofrunlength level encoded AC coefficient selection. MPEG makes use of a DCT 
transformation to remove spatial correlation before it is visually weighted quantised to 
disregard the less important information in a picture. After quantisation the DCT coefficients 
are scanned, either zigzag or by means of an alternative scan method, and runlength-level 
encoded, m order to reach the runlength-level encoded DCT coefficients, the video elementary 
stream must be parsed, starting at the picture header all the way down to the block layer A 
part of this parsing process can be done on byte basis, after the slice header this process must 
be performed by means of variable length decoding see ISO/EC 13818-2. 

The DCT transformation has the advantage that the important energy that build 
up me 8-pels by 8-lmes data block i^^^ 

means that for eg. with aid of the first 20 AC coefficients, the maximum amount of AC ' 
coefficient per DCTmatrix is 63, themost relevantpart of the 8-pels by 8-lines datablock can 
be reconstructed. A subjective spatial good quality of the picture can be maintained by 
tnmsntitting these 20 AC coefficients. When a large number of AC coefficients are removed 
than the spatial subjective quality can no longer be maintained and visible artefacts are 
mtroduced. This will be the case when for each runlength-level encoded DCT block, only the 
first 2 or 3 AC coefficients are selected and transmitted. The amount of AC coefficients 
available in anl-frameDCTBlock, depend strongly on the bit rate at which the original video 
sequence is encoded, as well as on the contents of the encoded 8-pels by 8-lines data block. 

By means of experiments the consequence of selecting a certain number of 
lower AC coefficients have been studied. Target of this experiment has been, that the resulting 
video elementary stream bit rate must fit into the D-VHS trick play channel. The picture 

re freshmfohasbeenputat8.3Hzandtheframe r ateisequalto25Hz.Table S 10, 11, 12and 
13 contains the results of this study. 
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No. of A( 
compone 


2 coefficients per 
at type 


Normal play video sequence: HARLEY 


Y 


U 


V 


Average bit 
cost 


min 
bit cost 


max . 
bit cost 


averaee bit: rate 

GOP=IPP 

Mbits/s 


2 


2 


2 


202015 


120392 


243144 


1.751592 


3 


2 


2 


219484 


130216 


267824 


1.897179 


4 


2 


2 


245087 


138264 


305064 


2.111010 


5 


2 


2 


271733 


152944 


341760 


2.333940 



Table 10: AC selection vetsus I-frame bit cost and average bit rate, seq. HARLEY. 



No. of AC coefficients per 
component type 


Normal play video sequence: BARB WIRE 


Y 


U 


V 


Average bit 
cost 


min 
bit cost 


max 
bit cost 


average bit rate 

GOP=IPP 

Mbits/s 


15 


10 


10 


200232 


63448 


496664 


1.735113 


18 


12 


12 


208029 


65032 


550376 


1.796715 


20 


15 


15 


214816 


67424 


579280 


1.858316 
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Table 1 1 : AC selection versus I-frame bit cost and average bit rate, seq. BARB WERE 



No. of A< 
compone 


C coefEcie 
nttype 


ntsper 


Normal play video sequence: NEDERLAND-2 


Y 


U 


V 


Average bit 
cost 


min 
bit cost 


max 
bit cost 


average bit rate 

GOP=IPP 

Mbits/s 


10 


10 


10 


237826 


131856 


237826 


2.058140 


12 


10 


10 


253752 


140688 


318552 


2.186047 


15 


10 


10 


339672 


150784 


270292 


2.325582 



12: AC selection versus I-frame bit cost and average bit rate, seq. NEDERLAND-2 



No. of A 
compone 


C coefficie 
nttype 


aits per 


f 

Normal play video sequence: GIRLS 


Y 


U 


V 


Average bit 
cost 


min 
bit cost 


max 
bit cost 


average bit rate 

GOP=D?P 

Mbits/s 


5 


2 


2 


251761 


226728 


291288 


2.200000 


8 


5 


5 ' 


301307 


264992 


363648 


2.600000 


10 


8 


8 


334409 


294664 


412272 


3.000000 



10 
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Table 13: AC selection versus I-frame bit cost and average bit rate, seq. GIRLS 

The video trick play sequence that is generated from those normal play streams 
with N-12 corresponds to a trick play speed, that is equal to four times normal play speed. 
5 Due to the fact, that a temporal picture refresh reduction of a factor three is required in order to 
transmit the transcoded 1-frame, all the normal play video GOP structure whereby N-12, will 
lead to the same trick play video sequence. For all the normal play video sequences which 
have a GOP length of N smaller or equal to twelve, the generated trick play sequence has an 
exact relation with respect to the normal play video sequence. This exact relation will not be 
maintained when the normal play video sequences have a GOP length greater than twelve. 

The video trick play quality depends strongly on the normal play video bit rate, 
as well as on the picture size. For the I-frame transcoding, the human visual system has been ' 
taken into account. For this reason the colour difference signals have been transcoded with less 
runlength-level encoded AC coefficients than the luminance signal. 

For those normal play video sequences which have a lower horizontal size, such 
as BARB WERE and NEDERLAND-2, and also have an acceptable bit rate, the achieved ' 
subjective video quality is ranked acceptable up to good. For the normal play video sequences 
HARLEY and GIRLS, the obtained video trick play quality is ranked poor up to acceptable. 
The reason for this subjective lower video quality is two fold. First, the normal play bit rate is 
20 high, second the horizontal picture size has the maximum values allowed for some MPEG 
applications. 

The subjective quality is strongly influenced by the amount of visible artefacts. 
Two main types of artefacts can be distinguished. First, artefacts due to removing information 
that form fine details. Second, artefacts due to removing information that built up 
discontinuities, such as edges, within the spatial area. This last artefact, has a strong influence 
on foe subjective judgement of foe video trick play sequence. 



15 



25 



From the I-frame transcoding results, the following conclusions can be made. 

30 ■ For normal play MPEG encoded video sequences with bit rates higher than 6 Mbit/s, 
especially for those sequences which have 720 pels per line and 576 lines per frame, strong 
artefacts are introduced because only a few runlength-level encoded AC coefficient can be 
selected, in order to stay within the available bit cost 
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amount of clearly visible artefacts. ** 
Tlus re-unea taowledge of ,he p^ „ J*™ 1 ; 

ance me subjective spatial picture quality for video trick play based on I 
tome extpacnon tap. an MPEG-2 pre-coded video sequence. 

Natural video has tempos as „, U as .paid combltan. MPEG video 

•ubjecave good pKtare quality. For inn. fame encode picpares the DCT oftt^ „. 
benveen tie coeffie.^ ta a tfven Mock and Ut. coefficients of n^tu^b^ ™ s fa 

eq m 2 : lheDCv 1 d M „fu K „ elghb<lunngl>lockjlls(codeiJ 

^cp^enon f „r the DC value in the current b.oc, ^ dilfe^ ^ „ J^se to 



ADC = DC - P 



(2) 



10 



15 



20 
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^ blocks, JrJTctriJT *t, r **- d — ta - * 

DCT DC vsJms wbich «-8-^ t «'ee=. "»e wdl bo a difference between He two succeeding 
m wluol > « not close to wro. t,,, n^j,^ ^ 8 

nt^tode, .so ta. as a*-* DC. is ,0 Mtt> J^^— 
chrominance DC see ISO/TEC 13818-2 table 8 5 

allocated for fcDC^^* 9 ^r^^^™ ,Wafl,l,> * lt,, - b " 
ize value is 9 bits for luminance and 1 0 bits for chrominance 

With aid of this 10 bits differential DC value a tnhi« «. 
The statistical behaviour of the differential tip v. 



>5 



Class one 


Class two 


Class three 


Class four 


differential DC 
value is equal to 0 


0 < differential DC 
value < 5 


5<= diflFerential DC 
value < 15 1 


differential DC 
valued 15 



Tabte 14: Oass definition for aatisdc analysis ofdt. differential DC valne 



„~" " to ^ t0p ^ a ^^^y^dnacollecaonbfobteined 

Fortbea^lvsiaofthe differ DC, adirfcentappro^bbas b« Jd '7*£Z 
before that for normal video seqoences .be,, is spanaI m we „ ffi ."j , 
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^.boamaR^p^^^ ^^^^ 

— :r=sr„ 



Luminance component Y J Chr 0 : 



Class 
1 

(%) 


Class 
2 

(%) 


10 


25 



12 



19 



minance component U 



4 



Table 15: Division of differential DC value according to the 
sequence HARLEY 



Chrominance 


component V 


Class 


Class 


Class 


Class 


I 


2 


3 


4 


(%) 


(%) 


(%) 


(%) 


4 


7 


3 


2 



chosen class definition for video 



Luminance component Y 



Chrominance component U 
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Table 16: Division of differential DC value according to the chosen 
sequence GIRLS' 



PCTAB99/01015 
class definition for vide© 



Luminance component Y 



Chrominance component U 



Chrominance component V 



Class 


Class 


Class 


Class 


Class 


Class 


Class 


Class 


1 


2 


3 


4 


1 


2 


3 


4 


(%) 


(%) 


(%) 


(%) 


(%) 


(%) 




(%) 














(%) 




10 


28 


14 


14 


4 


10 


2 


1 



Class 
1 



Class 

2 
(%) 



10 



Class 
3 

(%) 



Class 
4 

'0/ 



5 Table 17: Division of differential DC value according to the chosen class definition for video 
sequence NEDERLAND-2 



Lunik 


anceco 


mponen 


tY 


Chrominance component U 


Chrominance 


component V 


Class 
1 

(%) 


Class 
2 
(%) 


Class 
3 

(%) 


Class 
4 

(%) 


Class 
1 

(%) 


Class 
2 

(%) 


Class 
3 

(%) 


Class 
4 

(%) 


Class 
1 

(%) 


Class 

2 
(%) 


Class 

3 
(%) 


Class 

4 
(%) 


15 


28 


11 


13 


4 


8 


3 


2 


4 


9 


2 


1 



10 Table 18: Division of differential DC value according to the chosen class definition 
sequence BARB WIRE 



for video 



An algorithm can be developed mat defines how the runlength-level encoded 
AC coefficients can be assigned per DCT block, taking into account the differential DC value 
15 and its statistical occurrence. Beside the assignment algorithm, a buffer regulation algorithm is 
required m order to prevent a I-frame picture buffer overflow during the transcoding process 
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Range of differential DC ~" 


Size 


-2047 to -1024 


11 


-1023 to -512 


10 


-51 1 to -256 


9 


-255 to -128 


8 


-127 to -64 


7 


-63 to -32 


6 


-31 to -16 


5 


-15 to -8 


4 


-7 to -4 " 


3 


-3 to -2 


2 


-1 


1 


6 


0 


1 


1 


2to3 


2 


4to7 


3 


8 to 15 


4 


6to31 


5 


32 to £%\ 


6 


64 to 127 


7 


128 to 255 


8 


256 to 511 


9 


512 to 1023 


10 


1024 to 2048 


11 



Table 19: Variable length codes for differential DC value. 



Next, the assignment of AC coefficients will be further described. The spatial 
resolution of the transcoded I-frame is determined by the number of runlength level encoded 
AC coefficients per DCT block. The bit cost that can be used to transcode the intra frame 
encoded pictures is depicted in table 9. The optimum spatial resolution can be obtained when 

CONFIRMATION COPY 
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this bit cost is completely used by the transcoding process Thi« , , « 
I-^e.be.ausethemaxirnu.bitcostislin,^ 

the empty P-frames, which also have a fixed bi cl ^ ^ t08ether Wi * 

GOP WhentbelfL ^ , ^ ta a fixed bit «> st 

GOP. When the I-frame transcodmg process is not optimally performed, than stuffing can be 
performed, m order to reach the maximum I-frame bit cost. 

The main task of the transcoding algorithm will be two fold First <™ * 
^alreso^^ 

pasture must be transcoded in such away, that its final bit cost is lower or eoLt me 
_ value ^*^9.T>^^,^^££1 

^ofmetranscodingprocessacalculationcanbedonetodef^^ 

^^ationitisne^ary^ttheeiementary ^^fcJL^^T 

^arymsizeduetomefactmatatmeencodersideextr^ 

^eme.tarystream.Adecoderonmeo 

reqmres the followmg headers and its corresponding extensions. 



■ Sequence header 

■ Sequence extension 
20 ■ GOP header 

■ Picture header 

■ Picture coding extension 

■ Quant matrix extension 



25 



30 



2 «t „ , ^ ft ° U ^ ftere * em0rehead ^ 
numbe, of AC coefficena ,„„ coraponfc «, fce dif6renlial Dc 
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guarantee that the bit cost of the transcoded I-frame is equal or lower than the maximum bit 
cost of table 9, a buffer regulation is required. For the transcoding process two parameters are 
used for the buffer regulation. 

5 ■ Running slice bit cost 

■ Running frame bit cost 

The running slice bit cost, keeps track of the amount of bits used for the current 
transcoded I-frame slice. The running frame bit cost, keeps track of the total amount of bits 
10 spend on the up to then transcoded slices. A graphical representation can be made of the buffer 
filling of the I-frame transcoding process, see figure 9. 

The assignment of the AC coefficients per DCT block is a function of the 
differential DC value and depends on 

15 ■ The normal play bit rate 

■ The normal play picture size 

■ The running slice bit cost counter 

■ The running frame bit cost counter 

■ The frame bit cost difference 

20 

The first two parameters, are responsible for the initialization of the I-frame 
transcoder. With aid of these parameters the number of AC coefficients that will be assigned 
per class to a DCT block are defined. For apossible assignment see tables 10, 1 1, 12 and 13. 

A one time initialization will not be sufficient. The assigned number bits per 
25 slice, is not known before hand. This means that afterwards, a check has to be performed to 
see if the assumptions, as they were done during initialization, were correct, figure 10 
indicates a overall flowchart of a possible transcoding buffer regulation process. 

The flowchart contains functions and its corresponding arguments between 
brackets. The corresponding software model contains a precise description the buffer 
30 «gulation. The main parameters that are controlled by the buffer regulation are the borders of 
the classes as they where defined for the statistical analysis and the number of AC coefficients 
that will be assigned per DCT block. This action is taken at the bottom of the flowchart by the 
•modify^hboarderO...)' and , modify_no_ac_coef( )'blocks, respectively. 
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A further way of data compressing the information in a trick play signal, is by 
way of macroblbck truncation. Macroblock truncation means that one or more macroblocks in 
each slice of a picture, more specifically, counted from the right hand side of the picture, are 
deleted. A received DVB program has an unknown bitrate as well as an unknown picture size. 
For all possible signal situations, the transcoding process should work properly. Macroblock 
truncation is a possible step that can be applied in critical situations, as data reduction method, 
as well as that it can be used to allows for a better subjective picture quality. 

The main task of the transcoding process is to generate a valid MPEG video 
elementary stream. In a specific application, the maximum number of bits per macroblock is 
4608 bits. If such a stream enters the transcoding system, the chance that a transcoded picture 
fits into the target bitcost, is small. For this situation, macroblock truncation offers the 
possibility to strongly reduce the bitcost of the incoming intraframe encoded pictures. Because 
a picture of one macroblock per slice is a valid picture, the transcoder can give the guarantee 
that under all circumstances a valid video stream can be generated. 

Macroblock truncation can also be used to enhance the subjective video quality. 
This is done by deleting the last 5 or ten macroblocks or even more of all slices. The bitcost 
that would be used to transcode the deleted right hand portion of the picture, can now be spent 
on the remaining portion of the picture. Due to the fact that the decoder performs an 
upsampling in horizontal direction, the viewer will still have a full screen video. Because 
upsampling in horizontal direction is performed by the decoder, the shape of the objects in the 
spatial area are stretched horizontally. This can become annoying if too may macroblocks are 
deleted. 

Next, the generation of six trick play video signals will be described. 

The example of the track select trick play system described above supports six 
different trick play speeds, +/4, +/- 1 2 and +/- 24. All those trick play speed have their own 
trick play areas on tape, which form a virtual channel that during record will be filled with a 
video trick play transport stream. In order to prevent the implementation of six different I- 
frame transcoders, re-usability of trick play video will be performed. Re-usability is enabled 
by the fact that a fixed bit cost per GOP is used. The supported trick play speeds have a 
common dividend. The video trick play which corresponds to speed +/- 12 and.+/- 24 can both 
be deducted from speed +/-4. Figure 1 1 indicates the relation between the pictures for the 
different trick play tape speeds. 

When video trick play information is extracted, from the +/- 4 times video trick 
play speed, in order to generate video trick play for higher tape speed such as +/- 12 and +/- 24 
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^v.deo«okp %asweU . b ^ toWe ^^^ e0 ^^« 
video needs to be swapped. F igure , 3 Mcatts ^ 

N ^ a ' J ™ M ™P'™™'«»nof 1 ow^tovideo«ekp l aywillbe g iv«n.In 
^p^ SV« above, tbe vide, e.en,e n ,^ s ^p„ c ^ ng ^ bra . n 

o^toobto.nu.ebes,^.,^^^,^^^^ 
JirSZ, " , " aC '" :aI fm '" ana " ati< "' *—-■«•>—■.«-*. description 

The trick play channel bandwidth that is available for video trick play « 

nr , 2r aml " dis ^ ta " eib * eOTted ^ pMM ---™'-^n 

ntade bat the transport stream ovedtead is 5% of the totol bit rate. In the following 
deacnptton, the exact transport stream ovethead will be calculated 
For toe equation of the temsport abeam ovamead , disnncnm, can he mad. b«we«, me 

Z!t SUPP0,te4 D ° C to ^ **' WD re ™'» a °-. 30 Hz and 

29.97 Hz are aupported, die supported flam, rate* can bedepictod at theaupportrf scrumer 
rate. Table 20 indicates the supported record modes. 
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Frame rate 


Record and play back scanner rate 


25 Hz 


30 Hz 


30 Hz 


30 Hz 


29.97 Hz 


29.97 Hz 





Table 20: Supported record and play back scanner rate modes 



10 



15 



Table 20 indicates that there is aperfect fit for the 30 Hz frame rate situation 
and the 29.97 Hz frame rate situation. Each revolution of the scanner is equal to one display 
period. The GOP length N used for the video trick play elementary stream is equal to three 
This means that for the 30 Hz frame rate as well as for the 29.97 Hz frame rate the signal is 
periodic with 3 frame periods, or in other words three revolutions of the drum. As a 
consequence of this periodicity and the fact that 51 transport stream packets are recorded per 
revolution, the video trick play GOP as defined above can be depicted at 153 transport stream 
packets. Because of this situation an exact I-frame bit cost can be calculated. To calculate the 
I-frame bit cost all the required packets to transmit one GOP are depicted in figure 14. Table 
21 indicates the occurrence of each packet type. The smallest period interval to record a 25 Hz 
frame rate corresponds to 18 revolutions. Within this period 15 frames are stored, which is 
equal to 5 GOP's. 

With aid of the GOP layout at transport stream level depicted in figure 14, the 
total amount of transport stream overhead can be calculated. In table 21 the amount of 
transport stream overhead is depicted for the three frame rate situations. The characters 
a,b,c,d,e,f,g,h,i j and k in table 21 correspond to the characters used in figure 14. 
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Packet 
type 


Amount of ie\ 


rotations - 


= 3 


Amount of revolutions = 18 




Scanner rate/ 
Frame rate 
30 Hz 


TS 
Over- 
head 
(byte) 


Scanner rate/ 
rrame rate 
30*1000/100 
1 tiz 


TS 
Over- 
head 
(byte) 


Scanner rate 
= 30 Hz 
Frame rate 
= 25 Hz 


TS 

Overhead 
(byte) 


a 


1 


188 




188 


5 


940 


b 


1 


188 


1 


188 


5 


940 


c 


1 


27 


I 


27 


5 


135 


d 


72 


288 




288 


430 


1720 


e 


1 


12 


i 
1 


12 


5 


60 


f 


71 


284 


71 


284 


435 


1740 


S 


1 


27 


l 


27 


5 


135 


h 


2 


8 


2 i 


8 


10 


40 


i 


1 


27 


1 


27 


5 


135 


j 


2 


8 


2 


8 


10 


40 


k 


0 




0 




3 


564 



Table 21 : Transport stream overhead for three different recording modes. 
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rate a perfect fit of transnnr.*^ , *** 18 e « usU to manner 

to obtain periodicity The reason for * ' StUfBng k P erfo ™<* 

Packet k is only available after . ' wJP structure at transport stream level. 

overhead will be explored. am bandwidth, the transport stream 

Next, the bandwidth of the video elementary stream will be discussed 
A transport stream consist of packets with a fixed packet length of 1 8* hvt A 

distinction can be made between Melee* tw ♦ • Je ngtn of 188 bytes. A 

video infonnationa,dPES^ 

imanon andPES information, and packets that contain demultiolex inf^^ 

P-gramSpecificInformationCPSI.Thepackets^ 

PSI or video information or video «nrf PBS - f 8 * Contsun e,lher 

by extracting the non^ZT """""^ bandWid,h «» <» 

non vtd J? ■**• *» ,he "™ iUb 'a «*"»! bit rate Tie 

non vtdeo eatery te defilMld by ^ ^ ^ ^ «*• 

■ The PAT packet I 

■ The PMT packet 

■ The main ts header 

■ The adaptation field 

■ The PES header 

uuetotbts the 1-fhnneb.t cost,, slightly higher than the values finm table 9 - - 
PmtmOTtadtae »'»i>ba30H Z seanneran<i30Hzftamer.tewinhe 

paZnl,ltrT^^^ by ~^~° f ^^ 
Pacta ftta e» be ste^i per revohtdon. which is 5,, thnes the of ^ 

pcnod, w »,ch B ttn«e.™sr«uIteh ll 53 1 ^ P o^ p ^^3^ luta " S,KrGOP 
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Subtracting all the non elementary stream data from the 153 transportstream packets results i 
a video elementary stream bit rate of 2131920 bits/s. With a GOP structure of IPP, where by 
the P-frame have a fixed bitcost of 350 bytes, 30 slices times 1 1 bytes per slice plus 20 bytes 
for the picture header and picture header extension result in a I-frame bitcost of 207592 bits 
5 per picture. 

Next, an embodiment with a 29.97 Hz scanner and 29.97 Hz frame rate will be 
discussed. The bit rate calculation is almost equal to that of the 30 Hz scanner- and 30 Hz 
frame rate situation. The video elementary stream bit rate is 2129790.209 bits/s. The I-frame 
bitcost is not effected by 0.1% variation so with a GOP structure of IPP, where by the P-frame 
10 have a fixed bitcost of 350 bytes, 30 slices times 1 1 bytes per slice plus 20 bytes for the 
picture header and picture header extension result in a I-frame bitcost of 207592 bits per 
picture. 

Finally, an embodiment with a 30 Hz scanner and 25 Hz frame rate is 

discussed. 

15 The calculation for this situation is slightly different with respect to the previous two 

situations. Again the channel rate can be defined by multiplying the amount of transportstream 
packet that can be stored per revolution, which is 51, times the amount of revolutions per GOP 
period, which is 18. This results in 918 transportstream packets per 18 revolutions. Subtracting 
all the non elementary stream data from the 153 transportstream packets result in a video 

20 elementary stream bit rate of 2198266.66 bits/s. With a GOP structure of IPP, where by the P- 
frame have a fixed bitcost of 416 bytes, 36 slices times 1 1 bytes per slice plus 20 bytes for the 
picture header and picture header extension result in a I-frame bitcost of 257136 bits per 
picture. 

Next, the signal processing to generate a valid MPEG transport stream for 
25 recording that comprises a normal play transport stream component and a trick play transport 
stream component, will be described hereafter. 

A block diagram of the required signal processing blocks is depicted in figure 
15. The first signal processing block in figure 15 is the transport stream demultiplexer. This 
block extracts the video elementary stream from the multiplexed transport stream. The 
•30- information required to perform this operation is the video PID of the video elementary 

stream This information can either be obtained by parsing the PSI or can be delivered by other 
parts of the recording system. In case of PSI usage, special packets are parsed to obtain the 
required video PID. The first packet that is parsed is the PAT packet which has PID ='0'. This 
packet contains the PMT PID. This PMT PID, which is a customer defined value, carries the 
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video PID which is also a customer defined value. For detailed information see figure 4, and 
ISO/IEC 13818-lIncasethatmereisonlyonep^ " 
trick play is unambiguous. When there are more program such as for e. g . multi camera, then 
an arrangement is required to define on which program trick play is performed. A possible 
solution maybe that in case of multi program, trick play is generated for the first- or the last 
program in the PAT table. 

After extracting the video elementary stream from the multiplexed transport 
stream, I-fiame extraction is carried out. A video decoder can only start decoding when a 
sequence header, in case of MPEG-1, or a sequence header and a sequence extension, in case 
of MPEG-2, is received. For this reason, the sequence header and sequence extension are 
stored in memory. For those situation where a new GOP is not succeeded by a sequence 
header and a sequence extension, the in memory stored sequence header and sequence 
extension are inserted before sending the GOP header. The purpose of this insertion is to 
enable the video decoding process to start as quickly as possible after a switch from normal 
play to trick play. The stored sequence header and sequence extension header are updated each 
tare a sequence header and sequence extension header is received. This is important because 
the quantizer field may have been changed. The other field in the sequence headers must 
remain the same value for the whole video sequence. The next header that should follow the 
sequence layer is the GOP header. All extensions start codes that follow the sequence 
extension are ignored. This is done because they are not required by the video decoding 
process and only consumes bits that are necessary for the transcoding process. After the GOP 
header, the picture header should be received. Extensions that follow this header beside the 
picture coding extension and the quant matrix extension will be ignored. Up to now, all the 
filtering can be done on a byte basis. So far the following headers are parsed and necessary for 
the transcoded I-frame. 



■ Sequence header 

■ Sequence extension 

■ GOP header 

■ Picture header 

■ Picture coding extension 

■ Quant matrix extension 
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After the quant matrix extension, if available, otherwise after the picture coding 
extension the slices are received. These units contain the compressed video data. Slices canbe 
detected by parsing the video elementary stream on byte basis. From here on variable length 
decoding will take place, and the in chapter five described transcoding process is perforined. 
5 When the selected I-frame is reduced, a valid elementary stream must be 

obtained. For this reason so called empty P-frames are added to the reduced I-frame in order to 
have the correct frame rate. The P-frame horizontal size depends on the horizontal size of the 
original I-frame size. The amount of variation is limited because the maximum horizontal size 
which is maximal 720 pels in some MPEG applications. An empty P-frame must always 
10 contain the first and last macroblock of a slice, this is required by MPEG. The macroblocks 
that are in between the first and the last macroblock are skipped. This is way such frames are 
called empty P-frames. 

Before the video elementary stream is converted into transport stream packets 
with a length of 1 88 bytes, a Packet header is added to the individual pictures that build up the 
15 video elementary stream. The Packetized Elementary Stream (PES) consists of the individual 
pictures that build up the video elementary stream. The only difference is that a header is 
attached to each compressed picture which canies information such as Decoding Time Stamp 
(DTS), Presentation Time Stamp (PTS), DSM_trick_mode_flag etc. For more details see 
ISO/IEC 13818-1. The DTS controls the video decoding process and the PTS controls the 
video presentation process. These two time stamps form the second way to perform the 
decoding process. The first way is by using the VBVdelay that is available in the elementary 
stream picture header. The time base, Program Clock Reference (PCR), which is used for the 
decoding process is transmitted by the transport stream. The DTS and PTS are unique points 
on the PCR time axes. For an example see figure 16 

The generation of DTS and PTS can be done by just incrementing the DTS and 
PTS value by one frame period. The example given below described the calculation of this one 
frame period value. 



20 



25 



30 



Example: 

Frame period = 40 ms,.(25 Hz frame rate) 

System clock = 27 Mhz 

Amount_of_27Mhz_cycles__per_framejperiod = (Frame period * System clock ) 



1080000 =M0e-3 *27e6 
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The DTS and PTS have a resolution based on a 90 Khz clock. For this reason 
the Amount_oU7Mhz_cycles jer.fiamejeriod must be divided by 300. This division 
results in a frame period value of 3600. 

So when three frame period are required to transmit one I-frame and two empty 
P-frames, then the initialisation value for DTS and PTS becomes: 

DTS = 3* 3600 =10800 
PTS =4* 3600= 14400 

These values are depicted at a 33 bits wide field in the PES header. 

The initialisation value depends on the amount of time that is required to transmit the first 
pxcture to the decoder. This depends on me VBV.delay and the time consumed, extra delay 
by the multiplex process. 

The transport stream multiplex operation, multiplexes the packetized video 
elementary stream and the required Program Specific Information (PSD. For this purpose the 
packehzed video elementary stream is divided over K transport stream packets. Hereby is K 
the number of packets required to transmit one packetized video elementary stream picture 
W,th aid of table 21, the value of K can be calculated for the three supported recording 
situations. 

The transport stream layer takes care of several system aspects. The following 
system aspects are minimally required to create a transport stream that can be decoded by a 
transport stream decoder: 

■ Synchronizes the decoder time base to the encoder time base 

■ Contains a mechanism to deal with corrupted data 

■ Contains a mechanism to deal with time base discontinuities 

■ Indicate a random access points 



Video trick play, regardless the trick play speed, can be seen a video sequence 
and normally with a finite duration. Such a video sequence has a time base, the temporal 
direction, on which at regular time intervals, usually the frame period, pictures are decoded 
and presented at a display, see figure 16. The time base at the decoder side must be locked to 
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that of the encoder in order to prevent a drift of the audio visual information. For video trick 
Play there can only occur a drift in the video decoding and presentation process. To lock the 
decoder time base to that of the encoder time base a Program Clock Reference (PGR) is send 
to the decoder at regular time intervals. MPEG as well as DVB have put constraints on this 
parameter. Table 22 below contains the recommended refresh values for several transport 
stream parameters. 



Parameter name 


According to ETS 290 


Min 


Max 


PAT 


25 ms 


0.5 s 


PMT 


25 ms 


0.5 s 


PCR 


Oms 


0.04 s 



10 



15 



20 



Table 22: Packet distance for PAT.PMT and PCR packets 

Transport stream packets that are corrupted due to transmission errors can 
disturb the decoding process. They can for e.g. cause a pipeline error in the video elementary 
stream decoder. For the D-VHS system an error correction system is available that is capable 
of correcting most of the errors that occur during the read process. Packets that are corrupted 
and can not be corrected by the error correction system can either be flagged by means of a 
transporter Jndicator, or can be removed from the multiplex. A transport stream decoder 
can disregard packets which transporter ^indicator flag is active, this will increase the 
robustness of the decoder system. The missing video data is replaced by video data from the 
previous decoded picture. This process is known as concealment. 

In situations where there is a switch between normal play and trick play, the 
time base will make a jump. A transport steam decoder will react on such a jump by 
modifying its system clock caused by the PCR value of the new transport stream. This will 
lead to undefined situations in the transport stream decoder. It requires a certain time before 
the decoder will recover from this time base discontinuity and start working properly again. 
This recovery must be initiated by the decoder. This means that the decoder has to monitor its 
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behaviour to detect such situations. In onlertop^ 

can be made active each time a switch is made between nonnal p, ay and trick play or trick 
Play and nonnal play. Beside activating the discontinuity flag the data that refer to the 
prev,ous time base must not arrive at the input of the decoder and the first new data that 
5 should arrive at the decoders input must contain a random.accessjndicator, for detailed 
information see ISO/IEC 13818-1. 

The transport stream will be transported across a medium (e.g a record carrier). 
The delay of this medium must be equal for each transport stream packet. If this is not the case 
than tt is possible to corrupt the decoding time base. This is caused by the feet that some 

10 transport stream packets which contain the PGR value will take more time to arrive at the 

transport stream decoder input. The time sample taken at the encoder side, the PGR value will 
be used m the decoder to synchronize the local decoder time base. An extra transmission delay 
wrll cause jitter on the 27 MHz decoder clock. The maximum jitter that is allowed is defined 
by the ISO/IEC 13838-1, 2.4.2.1. Figure 17 indicates this process. 

15 The distance in time between two succeeding transport stream packets that 

contain a PGR value, should be fixed. This means that the -time which can be calculated with 
aid of the PGR value in the two PGR packets, and the time elapsed by the transmission should 
be equal. For the situation as depicted in figure 17a, the elapsed time should be 40 ms The 
situation is figure 1 7b can cause problems if the -time is large enough and fell outside the 

20 allowed jitter range. 

A storage device can also be treated as a transmission channel. If only a 
recording is performed, then the delay is infinite. This will normally not be the case. At play 
back, the timing between succeeding transport stream packets must be in such a way 
reconstructed that it becomes equal to the timing between succeeding transport stream packets 
25 as they arrived at the input of the storage device during record. For this purpose a process 
called time stamping is performed. 

Normal play time stamping in D-VHS, is a mechanism that attaches a time label 
based on the 27 Mhz clock which is locked to the incoming transport stream, to each incoming 
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„, w lllvulllllie uaiispon stream, to each incoming 

transport stream packet The time label is referenced to the time duration of one revolution, in 
case of trick play, or three revolutions,' in case of normal. play, of the drum. Duringplay back 
the timing between succeeding transport stream packets can be reconstructed with aid of this 
time label. Once a time label is attached to a transport stream packet, this packet can be 
manipulated in various ways. One important aspect of this manipulation is that the position, of 
a transport stream packet may be changed. Figurel 8 indicates this manipulation. Changing the 
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position does not mean the order in which should arrive at the decoder input. A manipulation 
such as depicted in figurel 8, occurs for example when the incoming transport stream has 
temporarily a higher bit rate than the D-VHS channel bit rate. Packets will be smoothed in 
time in order to store them on tape. Another situation occurs when trick play is added to the 
5 tape format. When there is no trick play, the normal play transport stream packets will be 

stored on a position within the tracks corresponding to the calculated position. This calculation 
is done with aid of the attached time stamp, see D-VHS system standard paragraph 2.4.2.1 and 
2.4.3 for details. When this calculated position is not free in e.g. when it is occupied by trick 
play data, than the normal play transport stream packet is shifted to the first free syncblock 
10 area. 

Due to the attached time stamp value, the original timing of the transport stream 
can be obtained during play back. 

For trick play there is also a time stamp mechanism. The mechanism works 
exactly the same as in normal play. Reference is made in this respect to earlier filed 

15 international patent application IB98/00131 (PHN 16614) for details. The difference between 
the trick play transport stream and the nonnal play transport stream is that the normal play 
stream can be a fixed bit rate stream and that the trick play stream is a fixed bit rate stream, 
when generated in the way described above. A fixed bit rate is defined as a transport stream 
whereby the transport stream packets have a equidistant distance on the time axis. In a variable 

20 bit rate transport stream, succeeding transport stream do not have a equidistant distance on the 
time axis. Because there are 51 transport stream packets read during one revolution of the 
scanner during trick play, time stamping becomes a simple process. For the software generated 
trick play transport stream, which is not a real time process, time stamps can be calculated by 
means of linear interpolation. The PCR values in the transport stream which contain samples 

25 of the real time 27Mhz encoder clock, can be used to generate the time stamps required for 
trick play transport stream recording. Linear inteipolation does not only deliver the exact time 
stamp values for those transport stream packets that contain the PCR fields, but also for the 
transport stream packets that lay in between the two PCR packets. This last phenomenon is 
caused by the fact that the transport stream has a fixed bit rate whereby the transport stream 

-30 - packets have a equidistant.distance. 

The following can be concluded Low-end video trick play based on the track 
select system has the potential of providing a subjective video quality that can be ranked from 
acceptable up to good. The trick play signal processing algorithm is a transcoding algorithm, 
based on selecting runlength-level encoded AC coefficients from selected normal play intra ' 
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frame MPEG encoded pictures. To suppress the amount of clearly visible artefacts, that ocoyr 
when only a small number of AC coefficient per DCT block are selected, the AC coefficient 
selection process depends on the value of the differential DC of that DCT block. This method 
will reduce the amount of visible artefacts that occur at edges. 
5 Although the amount of intra frame encoded pictures that can be transmitted to 

the receiver side is smaller than the supported frame rate a normal transport stream decoder 
can be used to display the decoded video trick play stream. Key factor hereby is the so called 
empty P-frame. With aid of such a picture*, a valid MPEG video elementary stream (valid, 
with respect to the frame rate) can be generated. This is important because all the provisions 
10 that are available to take over the video decoder control are a manufacturer's option. This 
means that there is no guarantee that a decoder control can be realized. Trick play based on 
repetition of intra frame pictures may lead to interlace disturbance. Such a situation will occur 
when the original video was not progressively scanned. With the aid of flags in the transport 
stream layer the video decoder can be forced into field repeat mode. But again this is a 
15 manufacturer's option. Trick play, based on I-frame selection, offers the advantage that the 
generated video trick play stream can be used for reverse trick play as well due to the fact that 
beside the interlace information there is no temporal information in the video information. By 
means of temporal sub-sampling in the MPEG compressed domain, high video trick play 
stream can be generated. Due to the re-usability of transcoded video trick play data there will 
20 only be one video transcoder to generate all the required video elementary trick play streams. 

Normal play channel stuffing is based on insertion of dummy syncblocks. Trick 
play channel stuffing based on syncblock stuffing makes trick play generation for reverse 
speeds unnecessarily complex. For this reason trick play channel stuffing based on dummy 
syncblocks has been abandoned and transport stream stuffing is introduced. Although 
25 transport stream stuffing requires two syncblocks, the trick play system complexity is reduced 
considerably. Due to the fact that two of the three supported frame rates can be mapped at the 
scanner rate, channel stuffing only occurs for the 25 Hz frame rate. 

Due to a fixed trick play transport stream mapping, time stamping becomes a 
signal processing step with less complexity. 

30 " — Next f -an apparatus of the helic^ 

information on a longitudinal record carrier, is described. Figure 19 shows the recording 
apparatus which comprises an input terminal 1 1 1 for receiving a video signal and a 
corresponding audio signal. The video signal and the corresponding audio signal may have 
been encoded into transport packets included in an MPEG serial datastream, well known in the 



WO 99/65239 PCT/IB99/0101S 

43 

art. The input terminal 1 1 1 is coupled to an input 1 12 of a 'normal play* processing unit 114. 
Further, a trick play' processing unit 1 16 is provided having an input 117 also coupled to the 
input terminal 111. Outputs 1 19 and 120 of the 'normal play' processing unit 114 and the •trick 
play 1 processing unit 1 16 are coupled to corresponding inputs of a multiplexer 122. The 
5 formal play 1 information as well as the 'trick play* information will be recorded in the tracks 
on the record carrier 140. 

For a further description of the 'normal play' processing unit 1 14 and the 'trick 
play' processing unit 116, reference is made to EP-A 702,877 (PHN 14.818). 

A subcode signal generator 124 is present for supplying the subcode signal 
10 information for storage in a subcode signal recording portion in the tracks on the record 
carrier. Outputs of the multiplexer 122 and the generator 124 are coupled to corresponding 
inputs of an error correction encoder unit 126. The error correction encoder unit 126 is capable 
of carrying out a error correction encoding step on the 'normal play' (video and audio) 
information and the trick play information, so as to obtain the parity information. 
15 The recording apparatus further comprises a generator 130 for adding sync and 

ID information. After combination of the signals in the combining unit 132, the combined 
signal is applied to a unit 134, in which a channel encoding is carried out on the composite 
signal. The channel encoding carried out in the encoding unit 134 is well known in the art. For 
an example of such channel coding, reference is made in this respect to US-A 5,142,421 (PHN 
20 13.537). 

An output of the channel encoding unit 1 34 is coupled to an input of a writing 
unit 136, in which the datastream obtained with the encoding unit 1 34 is recorded in the slant 
tracks on a record carrier 140, by means of at least two write heads 142 and 144 positioned on 
a rotating head drum 146. The write heads 142 and 144 have head gaps with a mutually 
25 different azimuth angle. Further, a time stamp generator 147 is available for generating the 
time stamps for the normal play processing unit 1 14 and the trick play processing unit 1 16. 

A microprocessor unit 148 is present for controlling the functioning of the 
various blocks, such as: 

- the control of the normal play signal processing block 1 14 via the control connection 150, 
— 30*" -~the control of the trick play signal processing block 1 J 6_via the control connection 152, 

- the control of the subcode signal generator block 124 via the control connection 154, 

- the control of the error correction encoding block 126 via the control connection 156, 

- the control of the sync signal and ID signal generator block 130 via the control connection 
158, 
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- the control of the channel encoding block 134 via the control connection 160, 

- the control of the transport velocity of the record carrier 140 and the rotation 'of the head ~ 
drum 146, via the control connection 162, and 

- the control of the time stamp generator 147 via the control connection 164. 
The trick play processing 1 16 is adapted to retrieve I-frame information from 

the first information signal, in the way described above. The trick play signal obtained for a 
specific trick play speed is accommodated in trick play sync blocks, for recording on the 
record carrier. 

Further, for each trick play information signal, trick play sync blocks are 
generated, in the sense that for each trick play sync block, a trick play speed identifier and a 
direction identifier are generated and stored in the trick play sync block and a time stamp is 
added to each packet in the various trick play information signals. 

Next, the trick play sync blocks and the 'normal play- sync blocks, generated by 
the normal play signal processing unit 1 14, are combined in the multiplexer unit 122. Subcode 
data is added and an error correction encoding is carried out on the combined normal play data 
and trick play data so as to obtain the parity information. Further, sync words and 
identification information is added Next, a channel encoding step is carried out on the 
information prior to recording the information in the tracks. 

Whilst the invention has been described with reference to preferred 
20 embodiments thereof, it is to be understood that these are not limitative examples. Thus, 

various modifications may become apparent to those skilled in the art, without departing from 
the scope of the invention, as defined by the claims. 

Further, the invention lies in each and every novel feature or combination of • 

features. 
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CLAIMS: 



1 



Apparatus for recording a digital video information signal on a record carrier 
the apparatus comprising 

- input means for receiving the digital video information signal, 

- trick play signal generating means for generating a trick play signal from said digital video 
5 information signal, so as to enable a trick play reproduction at a speed m times the nominal 

reproduction speed, where m is an integer larger than 1, 

- merging means for merging said digital video information signal and said trick play signal 
into a composite infonnation signal, 

- writing means for writing said composite information signal in a track on said record carrier, 
10 said trick play signal generating means being adapted to 

(a) retrieve intra encoded pictures from said digital video infonnation signal, 

(b) generate inter encoded pictures, 

(c) merge said intra encoded pictures and said inter encoded pictures, so as to obtain a trick 
play signal comprising subsequent groups of pictures, comprising an intra encoded picture 
followed by a number of n of said generated inter encoded pictures, where n is an integer ' 
larger than 0, the generated inter encoded pictures being such that, upon reproduction at said 
trick play speed, an inter encoded picture following an intra encoded picture results, upon 
decoding, in a repeated presentation of a picture obtained from decoding said intra encoded 
picture. 
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2. Apparatus as claimed in claim 1, wherein an inter encoded picture in said group 

of pictures has been obtained by an encoding step which is equivalent to encoding a picture 
which is a copy of the previous picture in the said group of pictures that led to the intra 
encoded picture in said group of pictures. 

3- Apparatus as~clahn"ed to claim" i or~2, "wherem ttVaiparatusls mn*er~adapted"~ 

to record a second trick play signal on said record carrier, said second trick play signal being 
meant for a reproduction from said record carrier at a second trick play speed p . m times said 
nominal reproduction speed, p being a constant larger than 1, said trick play signal generating 
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means further being adapted to generate said second trick play signal from said video 
information signal by selecting each p-th group of pictures from the first mentioned trick play 
signal, said merging means being further adapted to merge said second trick play signal into 
said composite signal. 

4. Apparatus as claimed in claim 3, wherein the apparatus is further adapted to 
record a third trick play signal on said record carrier, said third trick play signal being meant 
for a reproduction from said record carrier at a third trick play speed P . m . q times said 
nominal reproduction speed, q being a constant larger than 1 , said trick play signal generating 
means former being adapted to generate said third trick play signal from said video 
information signal by selecting each q-th group of pictures from the second trick play signal 
said merging means being further adapted to merge said third trick play signal into said 
composite signal. 

5. Apparatus as claimed in claim 1 or 2, wherein the apparatus is further adapted 
to record a second trick play signal on said record carrier, said second trick play signal being 
meant for a reproduction from said record carrier at a second trick play speed - m times said 
nommal reproduction speed, said trick play signal generating means further being adapted to 
generate said second trick play signal from said video information signal by reversing the 
sequence of pictures in each group of pictures in the first mentioned trick play signal, said 
merging means being further adapted to merge said second trick play signal into said 
composite signal. 

6. Apparatus as claimed in claim 1 or 2, wherein said trick play signal generating 
means is adapted to generate said groups of pictures in said trick play signal such that they all 
have a constant bit cost. 

7- Apparatus as claimed in claim 1 or 2, wherein said intra encoded pictures in 

said trick play signal are data compressed versions of intra encoded pictures comprised in said 
-4l^^.yi4 OT J°(°J n iation signal. 

8. Apparatus as claimed in claim 7, wherein said trick play signal generating 

means is adapted to data compress intra encoded pictures comprised in said digital video 
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information signal by lowering the resolution of said intra encoded pictures so as to obtain said 
intra encoded pictures of said trick play signal. 

9. Apparatus as claimed in claim 7, wherein said trick play signal generating 
5 means is adapted to data compress intra encoded pictures comprised in said digital video 

information signal by selecting a restricted number of AC coefficients of said intra encoded 
pictures so as to obtain said intra encoded pictures of said trick play signal. 

10. Apparatus as claimed in claim 9, wherein the trick play signal generating means 
10 is adapted to select a number of AC coefficients that may be variable for subsequent intra 

encoded pictures of said trick play signal. 

11 . Apparatus as claimed in claim 10, wherein the trick play signal generating 
means is adapted to select for the generation of an intra encoded picture of said trick play 

15 signal, for a current subpicture of said intra encoded picture, a number of AC coefficients from 
the corresponding subpicture of said intra encoded picture of said digital video information 
signal, said number having a relationship with the difference between the DC coefficient of the 
current subpicture of said intra encoded picture of said digital information signal from which 
the present intra encoded picture of the trick play signal will be derived and the DC coefficient 

20 of the previous subpicture of said intra encoded picture of said digital video information 
signal. 



12. Apparatus as claimed in claim 1 1, wherein said number increases for an 
increasing difference. 

25 

13. Apparatus as claimed in claim 1 > wherein the trick play signal generating means 
are adapted to 

(a) retrieve I pictures from said digital video information signal, 

(b) generate P pictures, 

30 — (c) merge said I pictoes_and said L P pictures so as to obtain said trick play signal comprising 
groups of pictures, comprising an I picture followed by a number of n P pictures, the generated 
P pictures being such that, upon reproduction at said trick play speed, a P picture following an 
I picture results, upon decoding, in a repeated presentation of a picture obtained from decoding 
said I picture. 
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M. Apparatus as claimed in claim 7, wherein said trick play generating means are 

adapted to data compress an intra encoded picture comprised in said digital video information 
signal by deleting subpictures of the said picture, starting at the right hand side of the picture 
towards the left hand side of the picture. 

15. Apparatus for recording a digital video information signal on a record carrier, 
the apparatus comprising 

- input means for receiving the digital video information signal, 

- trick play signal generating means for generating a trick play signal from said digital video 
information signal, so as to enable a trick play reproduction at a speed m times the nominal 
reproduction speed, where m is an integer larger than 1, 

- merging means for merging said digital video information signal and said trick play signal 
into a composite information signal, 

- writing means for writing said composite information signal in a track on said record carrier 
said trick play signal generating means being adapted to retrieve intra encoded pictures from ' 
said digital video information signal, and are adapted to carry out a data compression step on 
said retrieved intra encoded pictures so as to obtain data compressed intra encoded pictures in 
said trick play signal. 

16. Apparatus as claimed in claim 15, wherein said trick play signal generating 
means is adapted to data compress intra encoded pictures comprised in said digital video 
information signal by lowering the resolution of said intra encoded pictures so as to obtain said 
data compressed intra encoded pictures of said trick play signal. 

17. Apparatus as claimed in claim 15, wherein said trick play signal generating 
means is adapted to data compress intra encoded pictures comprised in said digital video 
information signal by selecting a restricted number of AC coefficients of said intra encoded 
pictures so as to obtain said data compressed intra encoded pictures of said trick play signal. 

18. Apparatus as claimed in claim 17, wherein the trick play signal generating 
means is adapted to select a number of AC coefficients that may be variable for subsequent 
data compressed intra encoded pictures of said trick play signal. 
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5 ZTll ^ -™P-iin 8 ^chn. of ^d in*. ^ £„*J 

5 said digital video infonnationsiawi saidmimk^i,, • , • 

between the DC coeffi • , , ^ ^ dnUmberhaVU,garelation5W P^ the difference 
between the DC coefficient of the current subpicture of said intra encoded picture of said 

tncicplaysigna.wiUhederivedandtheDCcoefficientoftheprevio. 

encoded picture ofsaid digital video information signal. ^ 

Z • ^ Am,2IZtUS 38 Cldmed ^ C,aim 19 > said number increases for an 
increasing difference. 

15 Lode Me *° d ° fr "°^ 
15 method comprising the steps of 

- receiving the digits! video mfotmation signal, 

- .WehpUy^notn^ digits, video info™*™ sign*, so ss „ enab te , 
tn* ptay sepnsttelon a, a speed n, dn.es <he nomine Ruction speed, where . is . 
integer larger than 1, 

20 - meting add digita, „deo inio^anon signa, and said trick ptay signs, into a composite 
lniormation signal, 

- writing said composite information signal in a track on said record carrier 
said tnck play signal generating step comprising the substeps of 

(a) retrieving intra encoded pictures from said digital video information signal, 
25 (b) generating inter encoded pictures, 

(c) merging said intra encoded pictures and said inter encoded pictures, so as to obtain a trick 
Play signal comprising subsequent groups of pictures, comprising an intra encoded picture, 
followed by a number of n ofsaid generated inter encoded pictures, where n is an integer 
^ertnanO the generated mter encoded^ 

-Jpck pjay spe_ed,.an inter encoded picture following an-intra encoded-picture results,-upon- 

decoding, in a repeated presentation of a picture obtained from decoding said intra encoded 
picture. 
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method comprising the steps of "=.mp 

- receiving the digital video information signal, 

^an^means fcrgeneranngatdctplaysig^l fiom saiddignal video information 

- writing said composite information signal in a track on said record carrier 
said tnck play signal generating step comprising the substeps of 

(a) retrieving intra encoded pictures from said digital video infonnation signal, and 

(b) canying out a data compression step on said retrieved intra encoded pictures so as to 
obtain data compressed intra encoded pictures in said trick play signal 



15 23 



23. RTOorfc ™^vmgadigitaIvideoiiiformation^ 
recorded on it. said digital video ior^on signal bemg meant for a ^roducjfrlm^ 
record earner at a nominal play reproduction speed, said trick play signal being meant for a 
reduction from said record carrier at a trick play speed m times said nominal reproduction 
speed, m being an integer larger than 1, said trick play signal comprising subsequent groups of 
Pictures, comprising an intra encoded picture, followed by a number of n of inter encoded 
Pictures, where n is an integer larger than 0, the inter encoded pictures being such that, upon 
reproduction at said trick play speed, an inter encoded picture following an intra encoded 
Picture results, upon decoding, in a repeated presentation of a picture obtained from decoding 
said intra encoded picture. 
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